Document classification as a theoretical problem of documentology
نویسندگان
چکیده
The changes and amendments to the general document classification as a theoretical problem of studies documentology, are discussed. in meaning “recorded information” is subject classification. author examines faceted-block based on various characteristics united into six clusters: “Types documents physical carrier”, by acquisition circumstances”, information representational transfer tools (by signative component)”, reception (perceptive their environmental circumstances”. within each facet independent which enables characterize any parameters. made increasing number weight electronic including digital versions originally non-digital documents. refined facilitate special particular types (classes, groups) applicable scientific disciplines teaching documentological disciplines.
منابع مشابه
A New Document Embedding Method for News Classification
Abstract- Text classification is one of the main tasks of natural language processing (NLP). In this task, documents are classified into pre-defined categories. There is lots of news spreading on the web. A text classifier can categorize news automatically and this facilitates and accelerates access to the news. The first step in text classification is to represent documents in a suitable way t...
متن کاملDocument Management as a Database Problem
1. Document Acquisition There is a broad spectrum of techniques how to acquire documents in such a way, that they are in computerreadable form and can be stored in a document base. This spectrum ranges from fully automatic at low cost via semiautomatic using tools like scanners and optical character recognition (OCR) to manual acquisition according to elaborate rules and regulations. The purpos...
متن کاملA Study on the Document Zone Content Classification Problem
A document can be divided into zones on the basis of its content. For example, a zone can be either text or non-text. Given the segmented document zones, correctly determining the zone content type is very important for the subsequent processes within any document image understanding system. This paper describes an algorithm for the determination of zone type of a given zone within an input doc...
متن کاملModeling Interestingness of Streaming Classification Rules as a Classification Problem
Inducing classification rules on domains from which information is gathered at regular periods lead the number of such classification rules to be generally so huge that selection of interesting ones among all discovered rules becomes an important task. At each period, using the newly gathered information from the domain, the new classification rules are induced. Therefore, these rules stream th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Nau?nye i tehni?eskie biblioteki
سال: 2022
ISSN: ['2686-8601', '1027-3689']
DOI: https://doi.org/10.33186/1027-3689-2022-9-147-168